Second Language Acquisition from Aligned Corpora
نویسندگان
چکیده
The paper describes a system for automatic aligning and searching for translation equivalents in large bilingual corpora. This implementation was developed to facilitate our tasks in GLOSSER #343 Coper-nicus'94 Joint Research Project, where Linguistic Modeling Laboratory was charged especially with preparation of bilingual material. The Gale-Church algorithm is chosen as aligning procedure for parallel texts. The main functional characteristics of our system MARK ALISTeR (MARKing , ALIgning and Searching TRanslation equivalents) are described. The program package can be used under MS Windows as an autonomous procedure of second language acquisition and CALL instrument. Evaluation of the results and the alignment errors of the algorithm and the tool is presented for diierent types of texts.
منابع مشابه
Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval
The present paper will seek to present an approach to bilingual lexicon extraction from non-aligned comparable corpora, phrasal translation as well as evaluations on Cross-Language Information Retrieval. A two-stages translation model is proposed for the acquisition of bilingual terminology from comparable corpora, disambiguation and selection of best translation alternatives according to their...
متن کاملThe TASX-environment: an XML-based toolset for time aligned speech corpora
This paper describes the design and implementation of an XML-based corpus environment for multi-tier annotated speech data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the...
متن کاملWord Sense Acquisition from Bilingual Comparable Corpora
Manually constructing an inventory of word senses has suffered from problems including high cost, arbitrary assignment of meaning to words, and mismatch to domains. To overcome these problems, we propose a method to assign word meaning from a bilingual comparable corpus and a bilingual dictionary. It clusters second-language translation equivalents of a first-language target word on the basis o...
متن کاملA Dynamical System Approach to Research in Second Language Acquisition
Epistemologically speaking, second language acquisition research (SLAR) might be reconsidered from a complex dynamical system view with interconnected aspects in the ecosystem of language acquisition. The present paper attempts to introduce the tenets of complex system theory and its application in SLAR. It has been suggested that the present dominant traditions in language acquisition research...
متن کاملAutomatic Translation Template Acquisition Based on Bilingual Structure Alignment
Knowledge acquisition is a bottleneck in machine translation and many NLP tasks. A method for automatically acquiring translation templates from bilingual corpora is proposed in this paper. Bilingual sentence pairs are first aligned in syntactic structure by combining a language parsing with a statistical bilingual language model. The alignment results are used to extract translation templates ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998